Reputation-based Worker Filtering in Crowdsourcing
نویسندگان
چکیده
In this paper, we study the problem of aggregating noisy labels from crowd workers to infer the underlying true labels of binary tasks. Unlike most prior work which has examined this problem under the random worker paradigm, we consider a much broader class of adversarial workers with no specific assumptions on their labeling strategy. Our key contribution is the design of a computationally efficient reputation algorithm to identify and filter out these adversarial workers in crowdsourcing systems. Our algorithm uses the concept of optimal semi-matchings in conjunction with worker penalties based on label disagreements, to assign a reputation score for every worker. We provide strong theoretical guarantees for deterministic adversarial strategies as well as the extreme case of sophisticated adversaries where we analyze the worst-case behavior of our algorithm. Finally, we show that our reputation algorithm can significantly improve the accuracy of existing label aggregation algorithms in real-world crowdsourcing datasets.
منابع مشابه
An Analytic Approach to People Evaluation in Crowdsourcing Systems
Worker selection is a significant and challenging issue in crowdsourcing systems. Such selection is usually based on an assessment of the reputation of the individual workers participating in such systems. However, assessing the credibility and adequacy of such calculated reputation is a real challenge. In this paper, we propose an analytic model which leverages the values of the tasks complete...
متن کاملIdentifying Unreliable and Adversarial Workers in Crowdsourced Labeling Tasks
In this paper, we study the problem of aggregating noisy responses from crowd workers to infer the unknown true labels of binary tasks. Unlike most prior work which has examined this problem under the probabilistic worker paradigm, we consider a much broader class of adversarial workers with no specific assumptions on their labeling strategy. Our key contribution is the design of a computationa...
متن کاملReputation-based Worker Filtering in Crowdsourcing
A Proofs of the theorems We first state a few helper lemmas. Lemma 1. Suppose the graph G is an (l, r)-regular graph, i.e. worker degree is l and task degree is r. Then, for each (w i , t j) 2 G, the following is true Pr(w i (t j) = 1) = 1 + (2 1)µ 2 , Pr(w i (t j) = 1) = 1 (2 1)µ 2 , and E[d + j ] = r 1 + (2 1)µ 2 , E[d j ] = r 1 (2 1)µ 2 where the probability and expectation are taken over th...
متن کاملAn incentive mechanism with privacy protection in mobile crowdsourcing systems
In order to improve the efficiency and utility of mobile crowdsourcing systems, this paper proposes an incentive mechanism with privacy protection in mobile crowdsourcing systems. Combining the advantages of offline incentive mechanisms and online incentive mechanisms, this paper proposes an incentive mechanism that selects the worker candidates statically, and then dynamically selects winners ...
متن کاملCrowdworker Filtering with Support Vector Machine
Crowdsourcing has been recognized as a possible technique to complement costly user studies, usability studies, relevance judgment for information retrieval studies, and training set build-up for automatic document classification. However, the quality of crowdworkers varies by diverse factors and we often cannot tell whether their answers are right or wrong immediately due to the lack of gold s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014